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ABSTRACT 

The Australian Square Kilometer Array Pathfinder (ASKAP) will revolu- 
tionise our knowledge of gas-rich galaxies in the Universe. Here we present 
predictions for two proposed extragalactic ASKAP neutral hydrogen (Hi ) 
f-n | emission-line surveys, based on semi-analytic models applied to cosmolog- 

ical N-body simulations. The ASKAP Hi All-Sky Survey, known as WAL- 
LABY, is a shallow 3w survey (z = - 0.26) which will probe the mass 
and dynamics of over 6 x 10 5 galaxies. A much deeper small-area Hi survey, 
called DINGO, aims to trace the evolution of Hi from z = — 0.43, a cos- 
mological volume of 4 x 10 7 Mpc 3 , detecting potentially 10 5 galaxies. The 
high-sensitivity 30 antenna ASKAP core (diameter ^2 km) will provide an 
angular resolution of 30 arcsec (at z = 0). Our simulations show that the 
majority of galaxies detected in WALLABY (87.5%) will be resolved. About 
5000 galaxies will be well resolved, i.e. more than five beams (2.5 arcmin) 
across the major axis, enabling kinematic studies of their gaseous disks. This 
number would rise to 1.6 x 10 5 galaxies if all 36 ASKAP antennas could be 
used; the additional six antennas provide baselines up to 6 km, resulting 
in an angular resolution of 10 arcsec. For DINGO this increased resolution 
is highly desirable to minimise source confusion; reducing confusion rates 
from a maximum of 10% of sources at the survey edge to 3%. We estimate 
that the sources detected by WALLABY and DINGO will span four orders 
of magnitude in total halo mass (from 10 11 to 10 15 M Q ) and nearly seven 
orders of magnitude in stellar mass (from 10 5 to 1O 12 M0), allowing us to 
investigate the process of galaxy formation across the last four billion years. 

Key words: galaxies: evolution - galaxies: luminosity function, mass func- 
tion - radio lines: galaxies - methods: N-body simulations 



1 INTRODUCTION 

Neutral hydrogen (Hi ) is an ubiquitous tracer of large 
scale structure in the Universe. It allows us to study 
the physical and dynamical processes within galaxies, 
including the kinematic properties of structures such 
as bars, disks and warps. Each galaxy Hi spectrum 
provides a large set of galaxy properties, for exam- 
ple the systemic velocity, the integrated flux density 
and the velocity width. These are used to derive the 
galaxy distance, its gas mass and its total dynamical 
mass, respectively. The gas mass is also a good indica- 
tor of ongoing star formation. 

The evolution of Hi is of fundamental importance 
to understanding the build-up of both the stellar and 



gas masses within galaxies as well as the method by 
which galaxies accrete their material. Due to the in- 
herent signal weakness of the 21-cm hyperfine split- 
ting transition, the detection of Hi emission in distant 
galaxies requires high resolution and high sensitivity 
observations. It is a crucial window into galaxy forma- 
tion over time; and while we wait for the next gen- 
eration of large-scale Hi surveys, we will explore their 
potential via N-body simulations. 

In Table [1] we present a, non-exhaustive, list of 
large area Hi surveys that have either been completed, 
or are currently ongoing, as well as several future sur- 
veys of note. 

Several large-scale Hi surveys were obtained with 
the 64-m Parkes telescope, made possible by the 
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Survey 


Sky area [deg 2 ] (resolution) 


Velocity range [kms ] 


Detections 


Reference 


HIPASS 1 


29343 (15.'5) 


-1280 <cz < 12700 


~ 5300 


Barnes et al. (2001') 


HIJASS 


1100 (12') 


-1280 < cz < 12700 


222 


Lang et al. (2003) 


ALFALFA 2 


7000 (4') 


-1600 < cz < 18000 


~ 15000 


Havnes et al. (20111 


EBHi S 


21400 (9') 


cz < 21000 


Underway 


Kerp et al. (2011) 


FAST 3 


4000 (3') 


z < 0.4 


2 X 10 6 


Duffy et al. (2008) 


SKA 4 


20000 (12") 


z < 1.5 


10 9 


Abdalla & Rawlings (2005) 



Table 1. We summarise here the key large area Hi surveys that have been completed, or near to completion, as we ll 
as future projects. 1 We have included several catalogues; the HIPASS Bright Galaxy Catalogue iKoribalski et al.|l2004l) , 
the southern hemisphere Hi catalogue l lMever et al.|[2004f ) and its northern extension (<5 < 25° ; IWong et al 1 12009) . 2 For 
ALFALFA detections we quote the values from the 40% catalogue which at the time of writing was the most complete 
published res ults. 3 The survey considere d here consisted of 600s integration times on source, lasting a year. 4 The SKA as 
considered in lAbdalla & Rawlings (200j3) would contain 50% of the collecting area within 5 km and we therefore use this to 
determine the typical resolution although in reality baselines may extend to 3 orders of magnitude larger. 



innovative 21-cm multibeam system which consists 
of 13 dual -polarisation feed horns a nd a powerful 
correlator (|Stavelev-Smith et al.l [19961 ) . Most promi- 
nent amo ng them is the H i Parkes All Sky Survey 
(HIPASS; lBarnes et al.ll200ll ). We also include the (un- 
finished) Norther n counterpart, th e Hi Jodrell All Sky 
Survey (HIJASS; [Lang et alj|2003 ) which utilised a 4- 
beam receiver. We list two major surveys currently un- 
derway in Table [T] The first is the Arecibo Legacy Fast 
ALFA survey (ALFALFA) which utilises the Arecibo 
L-Band Feed Array (ALFA) 7-beam receiver on the 
305m Arecibo dish. The second is th e Effelsberg-Bonn 
Hi Survey (EBHi S; lKerp et aL1l201 ll ), carried out with 
the recently installed 7-beam system on the 100m 
dish. Finally we present two future surveys in the ta- 
ble, the first is the Chinese-built Fi ve-hundred metre 
Aperture Spherical Telescope (FAST; lNarj2006l ) which 
utilises a 19 beam receiver; and Square Kilometre Ar- 
ray (SKAQ). 

There are three precursor instruments 
to the SKA: the Murch ison Widefield Array 
(MWA; lLonsdale et al.ll2009t). the Meer-K aroo Array 
Telescope (MeerKAT; iBooth et all 120091) and the 
Aust r alian SKA Pathfind er (ASKAP; I Johnston et all 
120081 ; iDeboer et al.ll2009r i. 

Here we will focus on planned Hi surveys with 
ASKAP; which is currently under construction in the 
Murchison Radio Astronomy Observatory in Western 
Australia. ASKAP will consist of 36 antennas (12-m 
diameter), of these 30 antennas are located within a 
2-km diameter circle. ASKAP 's large field of view - 
30 sq uare degrees - provided by novel phased array 
feeds (|Chippendale et alj|20ich make ASKAP a 21-cm 
survey machine. 

In the following sections we introduce two ASKAP 
Hi surveys: the shallow Hi All-Sky Survey (known as 
WALLABY; IKoribalski et al.ll2009l ). and the deep, but 
small -area Hi survey (known as DINGO; iMever et al.l 
120091 ). 



1.1 WALLABY 

The Widefield ASKAP L-band Legacy All-sky Blind 
surve Y (WALLAB\0) is a large project led by Barbel 
Koribalski and Lister Staveley-Smith. WALLABY pro- 
poses to observe ~75% of the sky (-90° < 5 < +30°) 
out to a redshift of z = 0.26 . In the WALLABY 
proposal (Koribals ki et al1l2009D it is estimated that 
500,000 galaxies will be detected over the full sur- 
vey area (assuming an angular resolution of 30"), of 
these ~1000 galaxies will be spatially well resolved (i.e. 
>5'in angular extent; or >10 beams). 

To achieve a much higher angular resolution of 
10"the full ASKAP configuration (36 antennas with 
baselines up to 6 km) is needed. Given the high com- 
putational cost of spectral line imaging of large vol- 
umes at such high resolution we are considering to ob- 
tain "postage stamps". These are high resolution mini- 
cubes formed at the position and velocity of particu- 
larly interesting galaxies determined a-priori. This is 
possible because the 21-cm data are collected by the 
full array and will serve many survey science projects. 
For simplicity in this work we assume that all galaxies 
will be imaged using the 6 km baselines as we are also 
interested in probing the issue of resolving out objects 
and hence studying what fraction of galaxies become 
non-detected. 

The WALLABY goals, outlined in detail in the 
ASK AP Survey Science Proposal (|Koribalski et all 
120091 ). are to examine the properties, environment and 
large-scale distribution of gas-rich galaxies. In sum- 
mary, WALLABY will study galaxy formation and the 
missing satellite problem in the Local Group, evolution 
and star-formation in galaxies, mergers and interac- 
tions in galaxies, the Hi mass function and its varia- 
tion with local environment, processes governing the 
evolution and distribution of cool gas at low redshift, 
and the nature of the cosmic web. 

WALLABY will also be able to investigate cos- 
mological parameters. For example, we will be able 
to measure the matter power spectrum in the local 



2 WALLABY homepage: 

1 SKA homepage: www. skatelescope . org| www.atnf.csiro.au/research/WALLABY 
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Universe. Furthermore, we should be able to constrain 
the e quation of state of Dark Energy to better than 
20% ([Duffy et alj|2012b» . Other tests that have been 
proposed for such a large Hi survey are studying the 
coherent bulk flows of galaxies on large scales (e.g. 
iBurkev fc Tavlorl 12004 lAbate et al.l l2008i ) , the mea- 
surement of Bary onic Acoustic Oscil lations and the 
Hubble constant l|Beutler et al.l 1201 ll ) , the measure- 
m ent of the rms mass fluctuations, as & growth factor, 
/ l|Beutler et al.ll2012h and the surface brightness dim- 
ming of obje cts with intrinsic brightness, the so-called 
Tolman test ijKhedekar fc Chakrabortill201lh . 

In this paper we also consider a proposed North- 
ern Hemisphere HI survey that, when combined with 
WALLABY, will provide a true Hi all-sky survey. A 
proposal has been submitted to the Astron Wester- 
bork Synthesis Radio Telescope facility to carry out 
the Westerbork Northern Sky Hi Survey (WNSHjQ). 
This project is led by Guyla Jozsa, and will target the 
Northern sky using new phased-array feeds that can in- 
stantaneously observe 8 deg 2 . WNSHS as proposed will 
likely be slightly deeper survey than WALLABY with 
a resolution similar to the full 6 km baseline of ASKAP. 
The science goals for WNSHS are similar, and due to 
filling in the sky coverage missed by ASKAP, entirely 
complimentary to WALLABY. As currently designed 
there are two survey modes for WNSHS, the first is 
a limited integration time of 4 hours per observation, 
which would perfectly match the expected sensitivity 
for the WALLABY survey. The second is a deeper, 12 
hour pointing, which would decrease the flux limits by 
1.7 and open up a much deeper exploration (more in 
line with the deep ASKAP survey, DINGO, discussed 
later). We consider both survey modes, with a 4 hour 
WNSHS model + WALLABY, termed 'ALL SKY' and 
the deeper WNSHS case on it's own. In this study we 
will show that the ALL SKY combination of WAL- 
LABY and the 4 (12) hour pointing WNSHS could 
potentially detect of order 800k (> 10 6 ) galaxies. 



1.2 DINGO 

The Deep Investigation of Neutral Gas Origins 
(DINGCp) survey is led by Martin Meyer and con- 
sists of deep and ultradeep phases which differ in area 
and depth. In the first phase, the survey proposes to 
target five non-contiguous fields, 150 deg 2 in total, out 
to z — 0.26. While the redshift range is the same as 
for WALLABY, the integration per field is 500 hours 
(> 60 x longer) providing 8x better sensitivity. Where 
feasible the target fields will be selected to overlap with 
the G alaxy And Mass Assembly (GAMA iDriver et"aL1 
120091 ) survey. The second phase is proposed to consist 
of two ultra-deep fields, 60 deg 2 in total, over the red- 
shift range z = 0.1 to 0.43. Very long integration times 
with 2500 hrs per field will enable DINGO UDEEP to 



3 WNSHS homepage: 

http : //www . astron . nl/~ j ozsa/wnshs 

4 DINGO homepage: 

http : //www. physics .uwa. edu. au/~mmeyer /dingo 

© 2012 RAS, MNRAS 000.[TH20l 



probe the evolution of Hi over the last 4 billion years 
of cosmic time. 

DINGO is designed to probe the Hi universe out 
to the maximum redshift possible, enabling the evo- 
lution in key cosmological parameters (such as the 
cosmic Hi density) and the Hi mass function to be 
measured. As well as this it will sample a sufficient 
volume to measure the two-point correlation function 
and the halo occupation distribution function as a 
function of redshift. Overlaps with GAMA fields will 
provide matching stellar properties for the DINGO 
Hi detections. Alternatively, for the Hi non-detections 
in these fields, the optical redshifts from GAMA en- 
able the possibility of Hi spectral stacking to extend 
the effective limit of DINGO. 



1.3 ASKAP Hi Predictions 

Estimating the performance of the proposed ASKAP 
surveys, WALLABY and DINGO, is a challenging the- 
oretical problem due to both the large cosmic vol- 
umes probed by these surveys (3.26 and 0.04 Gpc 3 
respectively) and the low detection mass threshold 
(~ 10 8 M ). 

Previous work on simulating Hi in cosmolog- 
ical simulati ons using fully hyd r odynamical simu- 
lations (e.g. iPopping et al.l 120091 ; lAltav et al.1 l201ll : 
iDuffv et all 1201281 ) have been limited to smaller vol- 
umes (< 100/i _1 Mpc) 3 and hence are unsuitable for 
making accurate predictions of the large variety of 
structure found with ASKAP. Instead we can use N- 
body simulations, which are computationally cheaper 
to run and hence can simulate larger regions of the Uni- 
verse. Creating galaxy properties in simulations that 
do not explicitly track the gas is called semi-analytic 
modelling. The low computational cost of running a 
galaxy formation model atop an existing A r -body simu- 
lation is such that the various parameters in the model 
can be tuned to successfully recreate numerous obser- 
vational constraints. 

Several recent attempts have been made to split 
the cold gas from the semi-analytic model into 
atomic and molecular hydroge n component s (e.g . 
Obreschkow fc Rawlingsl l2009al : iPower et all 120101 : 



L^igos**et**an 201 j). Using o bs ervati onal th e constraint, 
from iBlitz fc Rosolowskvl ((2006S) and iLerov et alj 
(2008), that the molecular - atomic ratio H2 / 
Hi de pends on the local Interstellar Medium pres- 
sure, lObreschkow fc Rawlingsl (|2009al ) found that the 
Hi mass function does not strongly evolve strongly un- 
til z > 1. Using a similar methodology IPower et al.l 
(|2010l ) found that this conclusion holds f or sev- 
eral semi-analytic models. lLagos et al.l (|201ll ) differs 
slightly in that they calculate the H2 / Hi ratio in 
the semi-analytic model itself and form stars from the 
molecular component alone. The non-evolving predic- 
tion of the Hi mass function holds as before, with 
some interesting new predictions at low Hi masses 
< 10 s /i -1 Mq which are below the typical thresholds 
for the galaxies found in ASKAP but which the SKA 
might detect in large numbers. 

In this study we seek a simple recipe to guaran- 
tee that the Hi masses in the simulation are similar 
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to those observed locally. Hence, rather than adopt- 
ing one of the previously mentioned techniques, we ar- 
gue for a more transparent method. One possibility 
could be to randomly assign Hi masses based on the 
local l y observed Hi mas s function (e.g. IZwaan et al] 
120051 ; iMartin et aIj|2O10h to dark matter haloes in an 
existing iV-body simulation. However, there are several 
advantages in using the galaxy properties from semi- 
analytic modelling. First, confusion estimates based on 
realistic clustering of galaxies become possible. In ad- 
dition we can probe the effects of modest evolution in 
the Hi abundance of the galaxies as their cold gas mass 
changes in the simulation. Finally, one can examine the 
galaxy stellar properties from the catalogue with the 
Hi selected sample. 

To these ends we make use of a semi-analytic cata- 
logue of galaxies to form a mock lightcone for ASKAP 
as detailed in Section [2] with the method by which we 
create Hi gas masses for each galaxy in the catalogue 
given in Section 12.11 We consider the sensitivity of a 
radio telescope in Section [2. 31 and the impact that re- 
alistic spatial and spectral features for the simulated 
galaxies will have when observed by instruments such 
as ASKAP in Section [2X21 and Section |2~3H In Sec- 
tion ll.ll and Section[L2]we detail the anticipated num- 
ber counts for WALLABY and DINGO respectively, 
along with the ability of ASKAP to probe large dis- 
tributions of galaxy properties in such a flux limited 
sample. The issue of confusion of Hi sources is consid- 
ered in Section [4. II for the particular case of DINGO. 
We consider the problems of identifying optical coun- 
terparts to the undetected Hi sources to enable the 
stacking of their spectra in Section 14.31 We compare 
the HI surveys in Section [5] Finally we conclude in 
Section [6] and emphasis the need for zoom-in, high 
resolution images around galaxy detections, so-called 
'postage-stamps', to limit the effects of confusion on 
source counts as well as guarantee a large sample of 
resolved galaxies that can be modelled using their ve- 
locity field information. 



action, which typically increase the concentration but, 
with sufficient feedback (e.g. by an accreting Super- 
massive Blackhole), gas can be expelled to actually 
reduce the concentr ation relative to t he Dark Matter 
only simulation fe.g lDuffv et al.ll2010j ). At all masses 
the haloes will be ~ 5 % too spherical but with iden- 
tical spin distributions (IMaccio et al. I l2008t l relative to 
the 'true' cosmology. 

We then utilised the Theoretical Astrophysical Ob- 
servatory (TAOj^, which is a CLOUD-based web ap- 
plication which produces mock catalogues from differ- 
ent cosmological simulations and galaxy models in the 
form of a lightcone (detailed in Bernyk et al. in prepa- 
ration) to create an all sky galaxy catalogue extending 
toz = 0.26 for the WALLABY survey and toz = 0.43 
for the narrower but deeper DINGO survey. Although 
the DINGO surveys are not contiguous fields (they 
are placed around the sky to lessen cosmic variance) 
we make the simulated lightcone a single field of the 
same total area, this will not have any impact on the 
expected galaxy counts. This tool creates a lightcone 
through the simulation volume, stitching a randomly 
rotated cube to the far edge to minimise the repeated 
structure in a cone. Objects that are broken up by 
this rotation of the cube are separately reattached to 
preserve the 1 and 2 -halo terms. The closest snap- 
shot in redshift from the full simulation catalogue is 
used where possible to accurately trace the evolution 
of galaxies. This resource provides stellar, dark matter 
and cold gas masses (amongst many other quantities) 
for all galaxies within the lightcone. 

We use the flux limit calculations from lDuffv et al.l 
(|2012bh and figures for ASKAP given in Table [2] for 
several HI surveys. In Table [3] (and 3} we provide sur- 
vey parameters and summarise the results of our sim- 
ulations. The respective surveys are WALLABY (as 
described in Section [TT]) , All-Sky (WALLABY plus a 
Northern hemisphere extension with the same resolu- 
tion and sensitivity, essentially the 4hour WNSHS sur- 
vey), WNSHS (the 12hour deep survey with 1.7a bet- 
ter sensitivity than WALLABY) and finally DINGO 
DEEP and UDEEP (as given in Section O) . 



2 METHODOLOGY 

The galaxy cata logue is created usin g the semi-analytic 
model (SAM) of lCroton et alJ (|2006T ) to produce galax- 
ies based on the und erlying dark-matter-o nly Millen- 
nium Simulation (|Springel et al.l [20051 '). This sam- 
ple of galaxies accurately recreates the observed stel- 
lar mass function with a combination of Supernovae 
feedback and, crucially to this model, feedback at the 
high mass end from active galactic nuclei. The cos- 
mology used in the Millennium Simulation is (Qm = 
0.25, Q.a = 0.75, Q. b = 0.045, cr 8 = 0.9, h = 0.73). 
This differs from the current best fitting cosmologi- 
cal model based on measurements of Supernovae and 
the C osmic Microwave Background (jKomatsu et al.l 
l201lh . most significantly in the relatively high erg 
value adopted in the simulation. This will have the 
effect of creating haloes that, at a fixed total mass 
of 10 11 no 14 )/!" 1 ^^, are ~ 30% (20%) too concen- 
trated (|Duffv et alJ 120081 '). We ignore the effects of 



baryons on the Dark Matter halo, the baryonic backre- 



2.1 Estimating Hi in Galaxies 

The ICroton et al.l (|2006T ) model produces a list of 
cold gas masses for the galaxies that we can at- 
tempt to break into neutral atomic, molecular hydro- 
gen and ionised hydrogen components located near 
to the galaxy. At the high mass end we would en- 
visage significant f ractions of neutra l hydrogen to be 
in m olecular form (|Lerov et al. l l2008l ; ISaintonge et al] 
1201 ll ). with the low mass end predominantly atomic 
hydrogen with a component of ionised hy drogen due to 
the i onising cosmic UV background (e.g. lOuinn et al] 
Il996l ). The cold gas - Hi mass fraction has been pre- 
viously st udied in sem i -analy tic works as both a con- 
stant, e.g. iPower et al] (|2010T ) found 0.54 after consid- 
ering He abundances and typical ionisation fractions 



http: //tao . it . swin. edu. au/mock-galaxy- factory 
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Parameter 


ASKAP core 


ASKAP full 




30 


36 


^off (m 2 ) 


2669 


3211 


Tsys (K) 


50 


50 


Max Baseline (km) 


2 


6 


Ang Res (z=0) 


30" 


10" 


Field-Of-View (deg 2 ) 


30 


30 



Table 2 . We summarise her e the strawman values of 
ASKAP Jjohnston et al.l bOOsh as it is likely to be oper- 
ational for the first several years of its spectral-line use, 
namely a reduced baseline model 'core' which utilises the 
inner 30 dishes across a maximum 2 km baseline rather than 
'full' with 36 dishes, and 6 km maximum baseline, which 
will be used for higher-resolution observations. Ultimately, 
one may be able to use the full 36 dishes of ASKAP us- 
ing short baseline pairs only such that the 30" resolution is 
attained for all dishes. In this case the number of pairs is 
only marginally increased however and therefore our sensi- 
tivity calculation with 30 dishes is still valid; for simplicity 
we assume only 30 dishes arc used for the 30" observa- 
tion. Furthermore, it is not clear what weighting scheme 
will be employed by ASKAP for the high resolution, 6 km, 
baselines and hence whether the full 20% flux sensitivity 
will be realised (for the case of natural weighting). To that 
end we conservatively assume that both the low and high 
resolution modes have the same flux levels. The effective 
area assumes an aperture efficiency of 80% and assumes 
the use of cross-correlations only (see text). Uniquely with 
ASKAP the Field-of-View is independent of frequency, and 
hence across the depth of the survey, with increasing Field- 
of-View as the observed wavelength lengthens balanced by 
modifying the number of beams formed on the sky. 



of the gas, as well as a function of the halo proper - 
ties themselves (e.g. lObreschkow fc Rawlingsll2009bf ). 
As we are interested in creating a realistic mock cat- 
alogue rather than understanding the Hi properties of 
galaxies we apply a reduction ratio R = Mm /M co \d 
to convert cold gas to Hi so as to recover the observed 
Hi mass f unction^ W e find that a functional form sug- 
gested bv lYang et al.l (|2003t ) for creating a conditional 
luminosity function by assigning stellar mass to dark 
matter haloes is particularly suitable for this conver- 
sion 




where the fraction of Hi in the cold gas is 
(Mm /M co id) = 0.41 at a characteristic cold gas mass 
M± = 8.8 x 10 10 M with faint end slope a = 0.52 and 
bright end slope ft — 0.56 for a sample of galaxies in 
the redshift range 0.02 < z < 0.07 centred on the mean 



6 In this work we t ake the ALFALFA mass func- 
tion jMartin et al .1120101 ) to model the galaxies on but note 
that this typic ally results in more HI rich galaxies than if we 
used HIPASS {1 waan et alj2005ft . The latter mass function 
results in 16% fewer galaxies, for example, in the DINGO 
DEEP survey. 




o 



7 8 9 10 11 12 
iog 10 (M [M sol ]) 

Figure 1. Here wc present several mass functions to in- 
dicate the methodology of obtaining a 'realistic' observed 
Hi mass p er halo. We start wit h the cold gas mass function 
from the ICroton et al ] l|2006h semi-analytic model (grey 
square) and compare with the observed Hi mass function 
from ALFALFA (or equivalents HIPASS) in the blue (red) 
solid (hatched) line llMartin et al.ll2010t IZwaan et aTll2005l. 
respectively). We then apply a cold gas - Hi conversion in 
the form of a broken power law function, given in Eqn. [l] 
before recalculating the mass function and once again com- 
paring to the ALFALFA observations. This is repeated us- 
ing an AMOEBA code to minimise the difference, resulting 
in the black triangles which are our final Hi mass function. 
Note that we adopted a cold gas mas s cut of 10 8 5 Mfl a s 
the resolution limit, in accordance with lPower et all d2010l) , 
which is why the grey squares abruptly end at low mass. 
The error bars are Poissonian (with the highest mass bins 
containing only two or three objects). 



redshift of the WALLABY survey z ~ 0.05 (as calcu- 
lated in iDuffv et al.l l2012al ) . We refer to the survey 
predictions using this conversion scheme as Fixed R. 
Interestingly, the normalisation of the fraction for the 
lo w redshift sample i s similar to the value suggested 
bv lPower et~ai1 (120101). 

As iPower et alj pOlch demonstrated the cosmic 
density of cold gas within semi-analytic models typ- 
ically increased by 0.2 dex over the redshift range 
z — — 0.43 and hence we can expect that our predic- 
tions with Fixed R will demonstrate mild evolution in 
the resultant HI mass function. However, we can also 
refit the mass function in several redshift slices such 
that we can recover a non-evolving flm ; we have calcu- 
lated these, fairly arbitrary, conversions for the higher 
redshift sample in DINGO UDEEP. We call this con- 
version the Fixed Qm model. We include the different 
predictions for the Fixed R / Qm cases in Table [3] 
However, in all plots and discussions in the text we 
will consider only the later case as it is both the most 
conservative in terms of galaxies detected as well as in 
better agreement with the data, which typically shows 
little evolution i n cosmic Hi dens i ty to z = 3 (e.g. 
Rao et al.1 120061; lLah et all 120071: iNoterdaeme et all 
20091 : iProchaska fc Wolfd 120091 ; iMeiring et all 1201 ll ). 
As shown in Fig. [T] this method creates an Hi mass 
function in excellent agreement with the data. 
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Parameter 


WALLABY 


ALL SKY 


WNSHS 


DINGO DEEP 


DINGO UDEEP 

1 £JJ{ fixed, XV J; IJtfcJU 


Sky Obs [deg 2 ] 


30940 


41252 


10313 


150 


60 


Survey [hrs] 


yOUU 


14oUU 


lOyUU 


zouu 


OUUU 


Redshift range 


u - u.zu 


n n 9fi 
u - u.zu 


u - u.zu 


u - u.zu 


U.l - U.4:0 


la noise [mjy kms -1 ] 












Av = 3.86 kms" 1 


1.592 


1.592 


0.937 


0.201 


0.090 


A.v = 100 kHz 


0.681 


0.681 


0.401 


0.086 


0.039 


D Hi [kpc] 


14 ±3 


14 ±3 


14± 3 


14 ±3 


15±|, 15±S 


Detected Djji [kpc] 


42±w 


42±i| 


42±i§ 


41±l| 


41±J6, 42±H 


Low Res 












> 1 beam [7V gal (%)] 


542706 (87.5) 


726346 (87.5) 


327454 (62.4) 


4926 (9.0) 


759 (1.4), 855 (1.4) 


> 3 beams [7V gal (%)] 


30859 (5.0) 


41397(5.0) 


11557 (2.2) 


152 (0.3) 


(0), 0(0) 


> 5 beams [7V gal (%)] 


6205 (1.0) 


8394 (1.0) 


2416 (0.5) 


38 (0.07) 


(0), 0(0) 


> 10 beams [A gal (%)] 


699 (0.1) 


946 (0.1) 


271 (0.05) 


2 (0.004) 


(0), 0(0) 


> 20 beams [JV gal (%)] 


74 (0.01) 


109 (0.01) 


37 (0.01) 


(0) 


(0), (0) 


High Res 












> 1 beam [7V ga i (%)] 


247038 (100) 


323156(100) 


356447 (100) 


44904 (94.8) 


37018 (75.3), 41490(76.5) 


> 3 beams [JV gaI (%)] 


239653 (97.0) 


310109 (96.0) 


153328 (43.0) 


4926 (10.4) 


759 (1.5), 855(1.6) 


> 5 beams [JV ga i (%)] 


146280 (59.2) 


183811 (56.9) 


31759 (8.9) 


955 (2.0) 


47 (0.10), 37(0.07) 


> 10 beams [JV gal (%)] 


22149 (9.0) 


26310(8.1) 


3761 (1.1) 


115 (0.2) 


(0), 0(0) 


> 20 beams [N sal (%)] 


2476(1.0) 


2868 (0.9) 


432 (0.1) 


10 (0.02) 


(0), 0(0) 



Table 3. We summarise here the proposed survey specific values for the two aspects of DINGO (DEEP and Ultra-DEEP), 
WALLABY and WNSHS as described in the text. For Ultra-DEEP we consider two evolution models; a varying Hi -cold 
gas fraction such that f!jri remains constant and a fixed Hi -cold gas fraction that results in a modest increase in f2jjl (th e 
latter given in bold face). For reference we include the ltr rms noise levels in each survey for two bandwidth measures of 
3.86 km s -1 and 100 kHz. Note that with the final decision on the baseline weighting still to be decided, the high resolution 
mode is conservatively set to the same flux limits as the low resolution case (therefore the number of resolved systems is a 
lower limit). We list the median and quartile limits of the Hi diameter of the galaxies in each survey, along with the value 
for those galaxies detected (in the low resolution survey only). We consider what fraction of these systems are resolved, i.e. 
have an angular diameter greater than the angular resolution of the telescope (the low res value is 30"). These are then 
separated into the expected number of galaxies that will be resolved by more than 1, 3, 5, 10 or 20 beams (in parenthesis 
are the percentage of detected galaxies in each case). We then consider a postage-stamp rcimaging around each of these 
detections at higher resolution; using 10" resolution available with the full 6km baseline of ASKAP. For the case of WNSHS 
the highest resolution imaging is 13". This extra resolution makes a significant difference to the final number counts of well 
resolved systems and argues for a high resolution rcimaging for strong galaxy detections. The survey number counts are 
presented in Table [3] 



This simple, yet effective, method produces a com- 
plex Hi - halo mass distribution that we examine in 
Fig. [2] In the top panel we consider the histogram 
distribution of the Hi - halo mass ratio, finding that 
the majority of systems have an Hi mass that is ap- 
proximately 1% of the total halo mass. This distribu- 
tion sharply drops off to higher ratios with a strongly 
asymmetric tail to lower ratios which extends to sys- 
tems incredibly Hi poor. If we consider which systems 
are, relatively, the most devoid of Hi we find (from the 
bottom panel of Fig. that they are the most massive 
systems in the simulation, in a greement with observa- 
tions (e.g. iKilborn et al.ll2009, ). 



2.2 Resolution Limits of the Simulation 

For such low Hi detection limits as proposed in the 
DINGO and WALLABY surveys there may be a po- 
tential issue with the resolution limits of the semi- 



analytic catalogue itself. This is because we have con- 
servatively set the minimum Hi mass of the catalogue 
to be the same as the cold gas mass limit, i.e. 10 8 ' 5 Mq. 
We can try and estimate the number of sources we 
may mass by using an observed HI mass function and 
extrapolating below this mass limit. To that end we 
use the empirical technique presented in iDuffv et al.l 
(|2012bl ). which include realistic rotation widths and 
HI disk sizes for the galaxies, to determine what frac- 
tion of galaxies would have been detected below the 
limiting HI mass cut for the proposed ASKAP surveys 
presented here. 

Us ing empirical relations from IDuffv et alj 
(|2012bh we find that the DINGO DEEP survey would 
detect 64859 galaxies and of these 2871 have an 
angular extent greater than 30". When we impose a 
limiting Hi mass of 10 8 - 5 M we find 60314 galaxies 
are detected, of these 2785 are larger in angular 
extent than 30". This means that by constraining 
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S/N Distribution 



WALLABY 



ALL SKY 



WNSHS 



DINGO DEEP 



DINGO UDEEP 

Qui Fixed, R Fixed 



5 


"gal 
Z 


620123 (738121) 
0.054 (0.059) 


829843 (988145) 
0.054 (0.059) 


524921 (583976) 
0.071 (0.075) 


54697 
0.130 


(55679) 
(0.131) 


53896 (54544), 59408 (60092) 
0.217 (0.218), 0.220 (0.221) 


6 


"gal 
2 


462923 (558084) 
0.049 (0.053) 


619174 (746740) 
0.049 (0.053) 


397288 (450584) 
0.065 (0.069) 


43614 
0.123 


(44690) 
(0.125) 


41126 (41720), 45446 (46145) 
0.208 (0.209), 0.211 (0.212) 


7 


"gal 
Z 


359394 (455654) 
0.045 (0.050) 


480476 (609595) 
0.045 (0.050) 


312469 (361622) 
0.060 (0.065) 


35600 
0.118 


(36652) 
(0.119) 


32514 (33111), 36079 (36763) 
0.201 (0.202), 0.203 (0.204) 


8 


"gal 
Z 


287390 (376405) 
0.041 (0.047) 


384325 (503236) 
0.041 (0.047) 


253222 (298782) 
0.056 (0.061) 


29693 
0.112 


(30663) 
(0.114) 


26556 (27104), 29337 (29955) 
0.194 (0.196), 0.196 (0.198) 


9 


"gal 

Z 


235502 (317191) 
0.038 (0.045) 


314858 (424281) 
0.038 (0.045) 


209555 (252324) 
0.052 (0.058) 


25386 
0.107 


(26377) 
(0.109) 


21838 (22389), 24411 (25000) 
0.189 (0.190), 0.191 (0.192) 


10 


"gal 
Z 


196119 (272341) 
0.036 (0.043) 


262328 (364399) 
0.036 (0.043) 


176959 (216630) 
0.049 (0.055) 


21980 
0.103 


(22933) 
(0.105) 


18407 (18934), 20654 (21183) 
0.184 (0.186), 0.186 (0.187) 


15 


"gal 
Z 


94767 (151174) 
0.028 (0.035) 


127064 (202187) 
0.029 (0.035) 


89993 (120789) 
0.039 (0.045) 


12402 
0.087 


(13235) 
(0.090) 


9027 (9443), 10142 (10658) 
0.167 (0.170), 0.167 (0.170) 


20 


"gal 
Z 


54688 (99285) 
0.023 (0.031) 


73349 (132934) 
0.023 (0.031) 


54558 (79653) 
0.032 (0.040) 


8230 
0.077 


(8977) 
(0.080) 


4997 (5422), 5868 (6282) 
0.157 (0.160), 0.156 (0.159) 


25 


"gal 
Z 


35031 (71613) 
0.020 (0.028) 


47015 (95879) 
0.020 (0.028) 


36585 (57474) 
0.028 (0.035) 


5941 
0.069 


(6620) 
(0.073) 


3090 (3416), 3611 (3981) 
0.150 (0.152), 0.148 (0.152) 


30 


"gal 
Z 


23808 (54753) 
0.018 (0.025) 


32070 (73334) 
0.018 (0.025) 


25874 (44165) 
0.025 (0.032) 


4576 
0.063 


(5188) 
(0.068) 


2049 (2334), 2427 (2713) 
0.144 (0.147), 0.143 (0.146) 



Table 4. We summarise here the pr oposed survey specific values for the two aspects of DINGO (DEEP and Ultra-DEEP) and 
WALLABY iKoribalski et 3.1.112009^ as well as a Northern extension plus WALLABY ('ALL SKY') and the WNSHS survey, 
similar to Table l3l We present two numbers for the predicted galaxy counts, and their mean redshift, reflecting the effects of 
including the reduction of signal-to-noise by spatially resolved galaxies, as demonstrated in Fig. [3] The brackets ignore this 
effect and therefore have a larger galaxy number count. There are second numbers in boldface for DINGO UDEEP which is 
our evolutionary model. The Hi mass function is allowed to evolve by retaining the same Hi - cold gas mass c onversion for 
z = but applied to the high redshift cold gas mass function, which increases with redshift in agreement with lPower et aTl 
There is more cold gas at high redshift, and hence more Hi , which results in the ultra-deep survey outperforming 
the wider angle, lower redshift DINGO survey in terms of number of galaxies detected. 



the semi-analytic catalogue to a limiting Hi mass of 
10 8 - 5 M Q we underestimate the DINGO DEEP survey 
by approximately 4500 galaxies, or nearly 8%. Of 
these missed objects the majority are unresolved, with 
only 1% having an angular extent greater than 30". 

For the low redshift WALLABY survey the full 
catalogue with no mass cut is estimated to be 672660 
galaxies and 471129 of these are resolved by the 2km 
baseline mode of ASKAP. When a limiting Hi mass of 
10 8 ' 5 Mq is assumed, the galaxy count drop to 631409. 
Of these objects, 462495 (522981) are resolved by an 
angular beam of 30". In other words we are poten- 
tially underestimating the WALLABY catalogue by 
40k galaxies, or 7.5% similar to the case of DINGO 
DEEP, but nearly a quarter of these galaxies would be 
marginally resolved. 

The predictions for the high redshift DINGO 
UDEEP catalogue are essentially unchanged by setting 
the limiting Hi mass of 10 8 ' 5 Mq. Therefore we caution 



the reader that the catalogue created in this work will 
likely underpredict the lower redshift DINGO DEEP 
and WALLABY surveys by 8%, with the vast majority 
of these systems being unresolved point sources for the 
case of DINGO but as many as a quarter of this miss- 
ing population marginally resolved in the WALLABY 
survey. With these caveats in mind we now consider 
the manner in which we can use the semi-analytic cat- 
alogue to create a realistic HI survey. 



2.3 Creating Hi surveys 

The monochromatic luminosity, L v from a cloud of 
Hi (of mass Mm ) with a 21-cm line emission line 
profile 4>{v) (a narrow function of unit area, where 
the frequency v is close to the rest frequency, vw, = 
1.42040575 GHz) is 
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-6 -5 -4 -3 -2 -• 
log 10 (M HI / M vir ) 




JyHz " M 49.8 I Mpc J 



5tot A^Hi 1 



(4) 



10 11 12 13 14 
iog, (M, ir [M sol ]) 



In the case of a boxcar line profile, Stot = S v Av 
where Av is in the observed frame of the galaxy. Note 
that a given rest-frame frequency or velocity width is 
measured by the observer to be (1 + z) narrower (i.e. a 
galaxy will span fewer observed channels at higher red- 
shifts for the same intrinsic rotation width). This be- 
comes important when calculating the signal-to-noise 
of the observation in the following section. 

We emphasise that this formula implicitly assumes 
an optically thin approximation for the Hi i.e. that 
there is negligible self-absorption of the Hi flux, in 
the galaxies themselves. This is a valid assumption 
for massive late-type systems with mean correction 
factors of 19% for Sb, 16% for Sbc fc Sc and 4% 
for Sa and Sab galaxies found in IZwaan et alj (|!997f ) 
after averaging over an gle of inclinations fro m cor- 
rection factors found in iHavnes fc Giovanellil <|l984h . 
In lHavnes fc Giovanellil |l984h smaller dwarf systems 
had a larger correction factor, however, the majority of 
the WALLABY and DINGO detections lie above these 
mass limits. Therefore to a good approximation we can 
ignore the effects of inclination on the detectability 
of the galaxy in terms of total flux received from the 
galaxy. There are observational consequences of the in- 
clination of the galaxy in terms of the observed velocity 
width and the resolving out of extended sources which 
we consider in Section 12.3.31 that are fully consistent 
with this optically thin assumption. 



Figure 2. In the top panel we show a histogram of the 
Hi mass - Halo mass ratio of all systems in the DINGO 
DEEP volume, indicating that our methodology results in 
a non-linear relation between Hi mass and underlying halo 
mass. This is not a surprise as the SAM catalogues have a 
cold gas mass that reflects a star formation and merger his- 
tory that will all act to create a complex interplay between 
the gas phase and the underlying halo mass. In the bottom 
panel we plot the ratio of Hi - halo mass as a function of 
halo mass and find that typically more massive systems are 
relatively Hi deficient. We have estimated errors within the 
points by bootstrapping the haloes but the 68% confidence 
limits are within the point size for all but the most massive 
datapoint. 



L v = ^hv 12 Ai 2 MH ' 4>(v) , 

4 TTlH 



(2) 



where A12 is the Einstein coefficient for the Hi spin-flip 
transition. We can relate this to the total integrated 
flux of the source in terms of the bolometric luminos- 
i ty ffT OT = Ltot/ (47^(2)) (Eqn 3.89 from |Peacockl 
(|l999f O at a luminosity distance di = D(l + z), where 
D is the comoving distance and Ltot = / L v dv. The 
total flux is the flux density, S v , integrated over fre- 
quency: 



Svdv ■ 



3 hvxiAvx Mm 

167T df(z) 771 H 



[y)dv. (3) 



This equation simplifies to 



2.3.1 Detection Limits 

As detailed in iDuffv et alj (|2008l ) and references 
therein, the expected thermal noise for a dual polari- 
sation single beam, single dish telescope is given by 



V2 



kT s , 



A y/AtAu 



(5) 



for an observing time of At within a bandwidth Av, 
which is assumed to be the ASKAP frequency resolu- 
tion of 18.31kHz (a velocity width AV = 3.86 kms -1 
at z = 0), where k = 1380 Jy m 2 K _1 is the Boltzmann 
con stant and r svs is t he system temperature. 

lThomps"onl (jl999h showed that for a dual polari- 
sation interferometer, the noise is reduced by a further 
V2 if the area A is taken to be that of a single element, 
i.e.: 



1 



A y/AtAv 



so that for a full interferometer with N(N - 
line permutations: 

(3 noise — V ^ 



A 



y/N{N- TjAtAv 



(6) 

l)/2 base- 



(7) 



The effective area of an ASKAP dish is the ge- 
ometric area of a 12 m diameter dish, a, reduced 
by the aperture efficienc y, expected to be a e g w 
0.8 (| Johnston et alj|2008h . Therefore, Eqn. [7] can be 
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re- written: 



V2 



(8) 



where A e g = a e g a^jN(N — 1). We assume iV = 30 
dishes within the 2-km core since, due to processing 
limitations, the full ASKAP array of 36 dishes is un- 
likely to be initially available. Note the well-known 
similarity between Eqn. [5] and [8] for large N. 

The flux density limit for an observation, Su m , 
depends on the required signal-to-noise ratio (S/N): 



Sum = (S/N)a no 



(9) 



A galaxy is less easily detected if its flux is spread 
across a number of frequency channels, resulting in 
lower flux density. For a boxcar profile of rest-frame 
velocity width W, a line profile will be spread over 
AT c h = (W/AV)/(l + z) channels, where AV is the z = 
ASKAP velocity resolution of 3.86 km s . As the 
noise is assumed to be Gaussian the uncertainty in our 
measurement of the mean flux density is reduced by 
the square root of the number of independent channels, 
or samples. Hence, for detection: 



Stot (Jy Hz) > (S/AO^piVch Au . 



(10) 



We can thus observe a galaxy if it has integrated flux 
greater than the integrated noise with a signal-to-noise 
cut off 



Stot (JyHz) > (S/N)a n 



(11) 



where we note again that the frequency bandwidth Av 
is fixed by the ASKAP correlator to be 18.31 kHz and 
that the number of channels an observed object spans 
will decrease as (1 + z) due to the redshifting of the 
Hi line, aside from any additional evolutionary effects. 




12 3 

log 10 (S/N) 

Figure 3. We consider the number of galaxies distributed 
as a function of signal to noise for the DINGO DEEP sur- 
vey. The Hi masses of t he galaxies ar e conv erted from the 
cold gas mass from the ICroton et all [|2006h semi-analytic 
model as given by Eqn. [TJ and converted into an Hi signal 
to noise ratio by applying Eqn. [4] and II II The grey solid 
histogram is the initial source population for all galaxies 
with c old gas masses above 10 8 5 Mq (found bv lPower et al.l 
j2010l l to be the resolution limit of the Millennium Simu- 
lation). We then calculate the observed galaxy counts for a 
DINGO survey in black dotted, with a special case in red 
dash for galaxies that are spatially resolved which suffer 
a loss of signal as described in Sec l2.3.3l Galaxies that are 
larger or face-on will be more resolved, although the impact 
on numbers for a deep survey like DINGO is marginal. Sev- 
eral objects lose nearly an order of magnitude in signal-to- 
noise as a result of this effect which may impact kinematic 
studies of these objects which demand more robust data 
than a simple detection. 



2.3.2 Disk diameter 

As considered in iDuffv et al.l (|2012bT ) a potential is- 
sue when using interferometers with high angular res- 
olution is the resolving out of galaxies more extended 
than the beam. With at least 2 km baselines assumed 
for the initial survey phase of ASKAP we will cer- 
tainly have to consider extended faint sources. We 
now su mmarise the procedure used in IDuffv et al.l 
(|2012bh to determine this issue. Although we have 
accurate Hi galaxy masses in the SAM catalogue we 
do not have a well-defined disk size and therefore 
utilise an empirically derived relation between this 
mass and the observed Hi diameter, Dm (defined to 
be the region inside which t he Hi surface density i s 
greater than lM fl pc~ 2 ). F rom fBroeils fc Rhed (|1997T ); 
IVerheiien fc Sancisil (|200lfi we have 

Du L = (Mm_y 

kpc V M norm J ' 

where they find an index of 7 = 0.55 and a normal- 
isation mass of Mnorm = 10 6,8 Mq. We then convert 
this diameter to an on-sky angular scale using the 
angular diameter distance cIa(z). Note that this re- 
lation is of critical importance to our results and we 
are confident that this relationship is well supported as 



more recent observations bv lNoordermeer et al.l (I2005I1 
have f o und identical best fit result s to lBroeils fc Rhed 
(|l99M ): IVer"heiien fc Sancisil (|200lT ) for an entirely new 
galaxy sample, observed with a different telescope. Al- 
though these works are limited to higher column den- 
sities than might be probed by the deeper HI survey 
DINGO we believe that it is still appropriate to use 
Eqn. [T2] as the majority of the HI signal will be from 
the high column density material described by this re- 
lation. 

The distribution of the typical diameters of the 
galaxies found in the various ASKAP Hi surveys after 
selection effects are taken into account are shown in 
Table [3] as well as the fraction of the galaxies in each 
survey which are resolved by ASKAP. 

To test the sens itivity of our result s to th is rela- 
tion we have refit the lNoordermeer et al.l (|2005f ) results 
for the case of a uniform average surface density of 
IMq pc -2 , i.e. 7 = 0.5, and find a normalising mass of 
Mnorm = 10 6 ' 5 M with only 10% greater reduced \ 2 
than the case where 7 is unconstrained. In using this 
more theoretically motivated diameter-mass relation 
our overall number counts, after considering resolving 
of extended sources, decrease by less than 1% in the 
case of DINGO, typically the detected galaxies are of 
order lkpc larger than the case where 7 is free to vary. 
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We are therefore confident that this critical assump- 
tion of a fixed Hi diameter - mass relation is both sup- 
ported by the data and that small variations around 
this best fit solution make percent-level modifications 
to the numbers quoted in Table [3] and hence can be 
ignored. Ev olution in the Hi diameter -mass relation as 
modelled in lObreschkow et al.l (|2009f ) was found to be 
negligible. 



2.3.3 Telescope resolution 

The angular resolution of ASKAP will be initially lim- 
ited to 30" at 21cm wavelength using the central 2 km 
core. Typically, for radio telescopes, the beam area in- 
creases like A 2 oc (1 + z) 2 . However, this is not the 
case for the ASKAP phased array feeds which have a 
roughly fixed covering area as a function of redshift. 

The effect of resolution on the detectability of 
galaxies depends on the efficiency and nature of 
galaxy-finding algorithms. Development of several al- 
gorithms is underway for AS KAP and prel i minar y 
testing has been undertaken bv lPopping et all (|2012ft . 
For the present purposes, we will make the assumption 
that the ASKAP source finders will be able to com- 
bine neighbouring independent pixels of a spatially- 
resolved galaxy to improve signal-to-noise ratio in the 
same manner as for a spectrally-resolved galaxy (see 
Eon. I10|) . We therefore assume that we have optimal 
spatial smoothing of our sources in the sky plane. 

The number of pixels the galaxy spans is approxi- 
mated by the ratio of the galaxy on-sky area, A ga j , and 
the beam area Abeam, each of which is calculated be- 
low. In practice the beam is convolved with the galaxy, 
meaning that even when a galaxy is just unresolved 
by the telescope (i.e. covers one pixel) we would still 
expect to lose signal. This convolution is represented 
by an additional factor of unity added to the number 
of beams, reducing the signal-to-noise by v2 in this 
matched case. Therefore the total flux of the galaxy, 
i.e. the left hand side of Eqn. 1111 is reduced by the ge- 
ometric factor y/l- 
S-0). 



+ -Agai /Abeam (an example of this 

is shown in Fig. 

To calculate the galaxy area we approximate the 
true 3D galaxy profile by an elliptical cylinder with 
a spectral profile that is independent of position. We 
further assume a random inclination to the observer, 
8, where the case of 8 = corresponds to face-on and 
8 = 7r/2 to edge-on. The projected area of the galaxy 
is 7r(DHi /2) 2 (B /A) where A and B are the major and 
minor axes respectively the ratio of which, B/A, is 
equal to cos(#), although in calculating this we limit 
the s mallest axis ra t io for spirals to 0.12 in accordance 
with lMasters et all (120031 ). 

For the nat ural Gaussian a ntenn a distribu- 
tions des cribed in | Stavelev-Smithl l|2006f ) and mod- 
elled in iGuptaet all (|2008f ). the Full- Width Half 
Maximum beam extent for ASKAP is Upwhm ~ 
1.4A/2000 m. The beam area, Abeam, is therefore given 
by 7rfiFWHM/(41n2), which we compare to the area of 
the galaxy (which has been multiplied by the square 
of the cosmological angular diameter distance, i.e. we 
have included (1 + z) 4 dimming). 

For the current design of ASKAP, with a 2 km 
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Figure 4. We calculate the observed galaxy counts for a 
WALLABY survey. The black dotted histogram represents 
the detections possible if all the flux of a galaxy was recov- 
ered, while the red dashed histogram is the situation if ex- 
tended galaxies are resolved out. This last case is described 
in Sec 12.3.51 Larger galaxies, or those face-on, will be more 
resolved by more synthesised beams and hence lose signal; 
with WALLABY losing nearly 15% of galaxies through this 
effect. 



baseline, this results in 12.5% of galaxies in WAL- 
LABY being resolved out. This effect is particularly 
an issue at the high-z end of the survey where galax- 
ies are faint (see Fig. |3| . Overall, despite being more 
spatially resolved, face-on galaxies are easier to detect 
in the ASKAP surveys considered here. This is shown 
in Fig. \5\ where a small increase of face-on systems 
are recovered from the DINGO survey relative to the 
input uniform cosine distribution. 

We note that a natural antenna weighting would 
result in a 20% increase in flux sensitivity for the 
high resolution case of ASKAP, with the additional six 
dishes being used. However it is still undecided which 
weighting scheme will be employed and hence we con- 
servatively assume the same flux limit for both the 
high and low resolution ASKAP images. 



2.3.4 Spectral resolution 

We calculate the velocity width, W, of the galaxy from 
the intrinsic linewidth width, W c , for a given angle 
of inclination 8. The intrinsic linewidth of a galaxy, 
corrected for broadening, has bee n shown empirically 
to be related to the Hi mass by (|Briggs fc Raol 1 19931 ; 
lLang et al.ll2003T ) 



420 kms- 



1O 1O M 



(13) 



although we note that this relation shows a large dis- 
persion, especially for dwarf galaxies. We have tested 
the consequences of this scatter by including an ob- 
served 0.2 dex lognormal dispersion in the veloci- 
ties and find that 2% more galaxies are detected 
(we henceforth ignore this negligible factor but note 
that for Tully-Fisher studies it will have to be mod- 
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elled). The linewidth of a galaxy, We, which sub- 
tends an angle 9 between its spin axis and the line-of- 
sight ca n be computed using th e Tully-Fouque rotation 
scheme (|Tullv fc FouqueHl985r i 



(W e sin(0)) 2 = W$ + V 2 



2W e W 1 - e 



'w e - 



-2V a 2 e~ 



■w B ' 



(14) 



where V c — 120 kms -1 represents an intermediate 
transition between the small galaxies with Gaussian 
Hi profiles in which the velocity contributions add 
quadratically and giant galaxies with a 'boxy' pro- 
file reproduced by the linear addition of the velocity 
terms. V a ~ 20 kms -1 is the velocity width due to 
random motions in the dis k (|Rhee fc van Albadalll996l ; 
IVerheiien fc Sancisj|200lT ). 

In cases where We » V c , one can see that We = 
V + VVoSintf. For 6 = 0, one finds that We = V a , in 
other words the Hi dispersion in the disk, whereas for 
6 — tv/2 we recover We = V + W e as expected. 

The small additional broadening effect on the 
Hi pr ofile due to the frequen cy resolution of the instru- 
ment (|Bottinelli et alj|l99d ) is negligible for ASKAP, 
and is ignored. Therefore for each galaxy we can 
uniquely assign an observed linewidth and angular ex- 
tent on the sky for a random angle of inclination and 
its Hi mass (see Fig. [5} ■ 
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Figure 5. We demonstrate in the grey solid histogram the 
input galaxy angle of inclinations 9 in the DINGO DEEP 
survey, with an assumed random, uniform distribution in 
cosine 9. After applying the resolution effects and flux sen- 
sitivity for this survey we see that there is a preference for 
galaxies which are face-on so that the signal is distributed 
over the fewest velocity channels, even though this angular 
extent increases the loss of signal through incoherent addi- 
tion of the galaxy flux in the plane of the sky by a realistic 
source finder. The exact mean (median) 9 value in degrees 
on input for this catalogue is 57.3° (60.0°) and after ob- 
servation the preference for face-on systems is clear as 9 
becomes 52.8° (54.9°). 



3 WALLABY 

With the large Field-of-View of ASKAP it becomes 
feasible to scan the entire sky, in 8 hour integrations, 
within several years, resulting in significant numbers 
of galaxies detected, ~ 0.6 million as given in Table [4] 

The effects of resolution on galaxy numbers, as 
shown in the high res case in Table [4] causes a sharp 
drop in the detection rates. Nearly 0.3 million small, 
faint detections would now drop below the signal-to- 
noise threshold of the survey due to the resolving out of 
structure. As shown in Fig. 3] galaxies are lost through- 
out the entire redshift range of the survey, including 
marginally resolved systems at high redshift which now 
drop out of the survey entirely. 

The creation of high resolution images of low reso- 
lution detections should clearly be limited to only the 
brightest sources, the exact threshold for which size 
and signal-to-noise detection should be rescanned is 
however entirely dependent on the performance of the 
source finder. 

Furthermore source confusion, where multiple 
Hi sources overlap, is almost never an issue for WAL- 
LABY (we direct the reader to Section f4. 1 1 where this 
technique is studied for the context of DIN GO) in good 
agreem ent with our analytic estimates in iDuffv et al.l 
(|2012bh . Additionally we find for WALLABY detec- 
tions we can uniquely identify optical counterparts 
from a given optical photometric (spectroscopic) red- 
shift catalogue to better than 97% (99%) as considered 
in Section [O] for the case of DINGO. 

Although the proposed WALLABY survey ex- 
tends to z — 0.26 the majority of galaxies detected 



are at much lower redshifts (< z >~ 0.05) and hence 
will have large angular extents on the sky, which can 
be expected to be well resolved by the 2km baselines of 
ASKAP. The distribution of the detections as a func- 
tion of proper physical diameter, for different redshift 
slices, is considered in the top panel of Fig. [6l with the 
on sky angular extents of these systems in the middle 
panel and finally the number of ASKAP beams that 
these systems are resolved by in the bottom panel. 

There are two interesting results from the top 
panel, the first is that the galaxies detected in a flux 
limited survey are progressively larger in diameter (the 
solid vertical lines indicate the mean size of the sam- 
ple) as this scales with Hi mass and hence flux. The 
second is that the WALLABY survey will detect or- 
ders of magnitude more galaxies within z=0.1 than 
are detected in the redshift range beyond. 

In the middle panel we consider the angular extent 
of all simulated galaxies in the WALLABY survey vol- 
ume (in grey solid histogram) and the detected subset 
with and without resolution effects in black dotted and 
red dashed respectively. This subset is typically several 
arcseconds in extent. The cumulative detection given 
by the red curve shows that at least 50% of detections 
are larger than 30". The blue vertical line indicates 
the resolution at 21cm of the 2km baseline of ASKAP; 
the intersection of this line with the cumulative curve 
indicates that that the majority of ASKAP detections 
are resolved. 

In the bottom panel of Fig. [S] we bin the galaxies 
by the number of ASKAP synthesised beams spanning 
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voids, will be probed by the survey. This will in turn 
allow the measurement of the the Hi mass function as a 
function of environment, as well as various galaxy for- 
mation properties such as the stripping of star-forming 
gas w ithin the hot haloes of clusters fe.g. lKilborn et aO 
120091) . 

We now consider the type of galaxies that the 
WALLABY sample will contain in Fig. [7] this repre- 
sents the true strength of using complex simulations to 
predict the galaxy catalogue as one can quantify the 
underlying galaxy population that a blind Hi survey 
will probe. We consider several key galaxy properties, 
from top to bottom panels these are the total halo 
mass, Hi mass, stellar mass and velocity width of the 
galaxies. From this figure it is clear that WALLABY 
can expect to detect systems ranging over 4 orders of 
magnitude in halo virial mass and 7 orders of mag- 
nitude in stellar mass. The velocity width is approxi- 
mately proportional to the square root of the mass en- 
closed and hence has a smaller dynamic range but we 
can expect to detect systems with velocities ranging 
from 20kms~ 1 to 1000 kms -1 . This broad selection 
of galaxies will be a unique resource to astronomers 
interested in galaxy formation as well offering power- 
ful constraints on the nature of dark matter through 
structural analysis using the velocity widths of these 
systems. 
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Figure 6. We consider the sizes of the galaxies as a function 
of intrinsic physical diameter as estimated using Eqn. I12I 
for several redshift cuts (top), angular extent on the sky 
(middle) and the number of beams that the galaxies will 
be resolved by using ASKAP (bottom) for the WALLABY 
survey. 



the object. Although most (87.5%) galaxies are only 
'marginally' resolved with one or two beams there are 
at least 5% of all detections resolved by 3 beams (as 
given by the red curve) and 1% of detections resolved 
with 5 beams. A non- negligible number of sources are 
extremely well resolved with over 30 beams across the 
galaxy. 

The lightcone distribution of these sources, in RA 
and redshift, are given in Fig. lAll with the cosmic web 
clearly visible across the volume surveyed, indicating 
that a wide range of environments, from clusters to 



4 DINGO 

A great strength of ASKAP is the possibility of per- 
forming deep HI surveys to provide cosmologically rep- 
resentative samples of HI rich galaxies beyond the local 
universe (> 100/c galaxies out to z = 0.43, as given in 
Table 0}. The deeper Hi surveys with ASKAP, termed 
DINGO, faces a unique challenge relative to the low 
redshift WALLABY survey, namely the greater risk 
of confusion of Hi sources at the increased distances 
which DINGO will probe relative to the case of WAL- 
LABY. We were able to ignore the issue of confusion 
for WALLABY (and have indeed checked that this as- 
sumption is valid) but must consider it for the deeper 
surveys before estimating the galaxy catalogues that 
DINGO will likely produce. 



4.1 Source Confusion 

A potential issue in any galaxy survey is the abil- 
ity to uniquely identify detections rather than artifi- 
cially blending overlapping sources, in the sky plane 
and a long the line of sigh t, as one object. In previous 
work (|Duffv et al.ll2012bl ) we argued with simple ana- 
lytic calculations that the incidence of confusion was 
slight when one had a spectroscopic survey. With our 
galaxy catalogue we can make a much improved esti- 
mate by utilising the actual on-sky angular diameter 
and velocity widths of the galaxies as well the realis- 
tic clustering of sources in space to determine whether 
they overlap as well as considering the improvements 
afforded by imaging with the 6km baselines of ASKAP. 
In Fig. [8] we consider the confusion rate, defined 
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Figure 7. In these figures we consider the underlying 
galaxy distribution, from top to bottom, as a function of 
total halo mass, HI mass, stellar mass and velocity width 
for a blind HI survey, WALLABY. As a result of the wide 
range in Hi - halo mass ratios probed, Fig. [2] we expect the 
ASKAP surveys to probe stellar and halo masses over 4 or- 
dj)rgQlf2nRiASitIi^M.Fa43v00a4H2^dc range of halo velocity 
widths. We repeat this figure for the case of DINGO DEEP 
in Fig. [10] 



as the number of galaxies that overlap in angular pro- 
jection (minimum on sky angle is set by the telescope 
resolution, 30" or 10") and along the line of sight di- 
vided by the total number of galaxies within a redshift 
slice Az = 0.02. We note that we have an effective 
mass limit for this comparison of Mjjf 1 = 10 8 ' 5 Mq at 
all times. If we considered galaxies below this Hi mass 
limit, the confusion rate would naturally increase, yet 
little additional mass would be spuriously attributed 
to the main objedQ to a typical detection in DINGO 
DEEP or UDEEP which is over an order of magnitude 
more massive than this lower limit. The three curves in 
this plot are, respectively, the confusion rate between 
all galaxies above the lower Hi mass limit, relative 
to the total number of galaxies in the survey volume 
(termed 'All- All'), the 'Detect- All' case which consid- 
ers the number of galaxies that are detected by DINGO 
DEEP or UDEEP that are also then overlapping with 
any galaxy above MjSj™ normalised by the number of 
detected galaxies and finally 'Detect-Detect' which is 
the self-confusion rate between detected galaxies. 

The DINGO DEEP survey is considered in the left 
column of Fig. [8] for a resolution of 30" (10") in the 
top (bottom) panel. This is repeated for the DINGO 
UDEEP survey is in the right-hand column. Immedi- 
ately apparent is that the confusion rate increases as 
a function of redshift as the number density of sources 
within a typical beam increases. The second point is 
that the confusion rate when one considers a detected 
subsample, 'Detect-All', is typically ~ 0.3 dex higher 
than the 'A11-A1P confusion rate of all galaxies above 
Afgf\ 

This is at first a surprising result as often HI 
sources are ofte n seen as the most weakly clustered 
population (e.g. iMever et al.ll2007l : iMartin et al.ll2012h 
and hence the 'Detect' sample, which will be typically 
be more massive than 'All' , should be just as confused 
as 'All'. The 0.3 dex increase is in fact a combination 
of three possibly effects. 

The first is that more massive objects do lie within 
denser environments, and while Hi is a weakly corre- 
lated tracer there are tentative results that overdensi- 
ties are still detectable over 'field' galax ies, as shown 
for th e case of the Fornax cluster in IWaugh et al.l 
(120021 ). The second fa c tor is that the semi-analytic 
model of ICroton et alj l|2006D did not model proxim- 
ity effects for infalling objects within overdense regions 
through mechanisms such as gas stripping, harassment 
or starvation. Therefore we should expect a higher 
clustering signal than otherwise expected making our 
conclusions about confusion limits for the ASKAP sur- 
veys a conservative case. 

The final factor is numerical in nature and is be- 
cause we can only track objects to Mfjf — 10 8 ' 5 M0. 
This means that although massive galaxies in our cat- 
alogue have resolved satellites, objects at this limit 

7 Provided the Hi mass function has a faint end slope 
sufficiently shallow that the total mass in systems be- 
lo w a given threshold mass converges i.e. a > —2, based 
on IZwaan et al.l i2005T ) who find a = -1.37 ± 0.03 ± 0.05 
measured down to Afjji ~ 10 7 Mq which is sufficiently 
shallow for our conclusion to stand. 
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Figure 8. We investigate the confusion rates of overlapping galaxies in both the DINGO DEEP and UDEEP surveys (left 
and right columns respectively) with two possible dish configurations for ASKAP, a core design with only 2km baselines 
and an extended 6km case (giving 30" and 10" resolution shown in the top and bottom rows respectively). To calculate the 
incidence rate of galaxies overlapping in rcdshift slices of width Az = 0.02 we utilised the angular extent of the galaxies on 
the sky (if unresolved the angular diameter was fixed to be the telescope resolution) together with the velocity widths along 
the line of sight. We consider two cases of confusion; between all galaxies above a limiting Hi mass M^f = 10 8 ' 5 M s , 'All- All' 
in black triangles, and the confusion rate of the detected galaxies in the survey with any galaxies, 'Detect- AH' red diamonds. 
The former case is normalised by the number of galaxies in a redshift slice, the latter by the number of detections in the 
same slice. Errors are Poissonian. The largest confusion rate is seen to be at the survey edge of the deepest survey, DINGO 
UDEEP (top right panel), which reaches ~10% if only 30" resolution is attainable. If a strategy of follow-up observations 
at 10" are adopted then this rate becomes negligible (< 3%). Intriguingly, we note that the detected sources are more often 
confused than the average rate, this is discussed in the text. 



won't have their own satellites. We can test the severity 
of this issue by choosing a mass cut of Mff = 10 9 Mq 
and re-running the analysis, in this case the difference 
is lessened (as might be expected since proportionally 
more galaxies above this limit in the volume are de- 
tected) but only by 0.1 dex leaving the additional dis- 
crepancy a likely combination of the first two explana- 
tions. 

Within the entire DINGO DEEP sample of detec- 
tions 766 (350) galaxies, or a rate of 1.4% (0.7%), are 
confused with galaxies of mass greater than Mm — 
10 s - 5 M with the ASKAP 30" (10") beam configura- 
tion. Of the detections that are confused with other 
detections, i.e. self-confused, the rate is of course far 
lower with only 62 (34) galaxies, or a rate of 0.11% 
(0.07%), with the ASKAP 30" (10") beam configu- 
ration. With a deeper galaxy sample one might ex- 
pect the DINGO UDEEP survey to suffer a greater 
incidence of confusion within a fixed telescope beam 
resolution, and indeed 3971 (1015) galaxies, a rate of 



4.9% (1.3%), suffer confusion for the 30" (10") ASKAP 
beam. The self-confusion incidence in UDEEP is also 
slightly higher than in DEEP, with 406 (118) galax- 
ies overlapping with other detected galaxies, a rate of 
0.5% (0.15%). 

4.2 HI detections 

With the previous result that the overall number 
counts in DINGO will be only slightly affected by the 
issue of source confusion we can consider the distri- 
bution of the detections as a function of redshift in 
Fig. H for the DEEP and UDEEP aspects of DINGO. 
We see immediately that the surveys compliment each 
other, with the increased observing time over smaller 
areas enabling the UDEEP part of DINGO to extend 
the Hi detections to z = 0.43. Note that there are sev- 
eral orders of magnitude more galaxies undetected in 
the survey volume with Mm = 1O 8 5 M . These can 
be stacked to create a measurable signal from individ- 
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Figure 9. The predicted galaxy number counts for the 
surveys; DINGO DEEP and UDEEP (top and bottom re- 
spectively). The grey solid histograms represent the initial 
galaxy catalogue and the red dashed indicate source find- 
ers that only incoherently summate signal from extended 
sources and hence is a more conservative estimate of the 
ultimate telescope performance. These figures demonstrate 
the strength of ASKAP to probe the evolution of Hi out 
to z = 0.26 and 0.43 in the DINGO DEEP and UDEEP 
surveys, respectively. 
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ually undetected sources (after cross-correlating with 
optical surveys, the issue of identifying these galaxies 
is considered in Section [4.3(1 . 

The properties of the detected galaxies in DINGO 
are similar to that probed by WALLABY, with the 
caveat that the simulations only contain galaxies with 
Mm 10 8 ' 5 Mq and hence the likely low mass sys- 
tems in the latter are missing. In Fig. [TO] we consider 
the total halo mass, HI mass, stellar mass and intrinsic 
rotational width of the sample of galaxies detected by 
DINGO DEEP. The survey detects 4 orders of mag- 
nitude in total mass, 7 orders of magnitude in stel- 
lar mass and nearly 2 orders of magnitude in velocity 
width. 

A significant science case for large area Hi surveys 
is to measure the Hi mass function; as is shown in 
Fig. 111! the ASKAP telescope will create a complete 
mass function above Mm w 10 10 M Q with the DINGO 




1.5 2.0 2.5 3.0 
log, (W [km/s]) 

Figure 10. In these figures we consider the underlying 
galaxy distribution, from top to bottom, as a function of to- 
tal halo mass, HI mass, stellar mass and velocity width for 
a deep HI survey, DINGO DEEP. The range of properties 
probed is similar to WALLABY, given in Fig. [7] previously. 
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Figure 11. The blu e line is the obser ved Hi mass func- 
tion from ALFALFA l|Martin et aLlbOlOh with errors repre- 
sented by the thickness of the line and in grey is the Hi mass 
function from the simulation volume, with Poissonian errors 
given by the shaded region. We then perform an observa- 
tion using the DINGO DEEP survey parameters and plot 
the resultant flux limited mass distribution recovered in the 
red squares. Error bars are Poissonian. DINGO DEEP is 
complete for galaxies above the 'knee', Mm ^ 1O 1O M0, 
allowing us to measure evolution in the high mass end of 
the Hi mass function over the entire redshift range of the 
survey; and to progressively lower redshift ranges for lower 
masses. 



survey. This will enable the measurement of evolution 
in the high mass end of the mass function across the 
entire redshift range, and to progressively lower masses 
for lower redshift ranges. 

We now consider the physical extent of the sources 
detected by DINGO in Fig. [12] which are typically of 
smaller physical extent (top panel) at a given redshift 
than the corresponding WALLABY sources as a result 
of the greater integration time per field which allows 
fainter (and hence physically smaller) galaxies to be 
detected. As shown in the middle panel of Fig. 1121 the 
majority of galaxies are unresolved (comparing the in- 
tersection of the cumulative red curve with the ASKAP 
30" beam denoted by the vertical blue line). If we cre- 
ate a histogram of the number of synthesised beams 
the galaxies are resolved by (bottom panel) we see less 
than a few percent of galaxies are just resolved by more 
than one beam (DEEP and UDEEP resolve <9% and 
1% of galaxies, respectively). We consider the case of 
observing these systems with the 10" synthesised beam 
when using 6km baselines, as shown in Table [3] and 
find that now 95% of objects are marginally resolved. 
Although not a key science case for DINGO there could 
be as many as 100 galaxies resolved by more than 10 
beams which will be a valuable dataset to compliment 
the WALLABY sample. 

We illustrate the strength of DINGO as a deep 
survey instrument in Fig. I All in which the large scale 
structure is clearly visible out to the edge of the survey 
at 2 ~ 0.43 allowing the Hi properties of galaxies to be 
studied as a function of both environment and redshift. 
These lightcones are presented as one contiguous field 
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Figure 12. In these figures we consider the sizes of the 
galaxies as a function of intrinsic physical diameter as es- 
timated using Eon. IT2l for several redshift cuts (top), an- 
gular extent on the sky (middle) and the number of beams 
that the galaxies will be resolved by (bottom panel) for the 
proposed DINGO DEEP survey. For the case of DINGO 
UDEEP the majority of galaxies, even with the resolu- 
tion afforded by the ASKAP core configuration will be only 
marginally resolved in angular extent at best hence we do 
not consider it here. 
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while in reality the DINGO fields are distributed across 
the sky, thereby reducing the effects of cosmic variance. 



4.3 Matching Optical Catalogues 

We can use our catalogue to estimate typical misiden- 
tification rates when uniquely assigning optical coun- 
terparts to our Hi detections. This measurement dif- 
fers slightly from the confusion rate discussed before. 
For the case where there are, say, two (three) possi- 
ble counterparts one can always blindly choose one 
for the optical counterpart, statistically speaking, in 
this case we will be correct 50% (33%) of the time. If 
one considered priors such as the size of these systems 
then we can be substantially more reliable than simple 
blind guessing. Therefore the misidentification rate is 
slightly lower than the confusion rate due to the pos- 
sibility of randomly assigning objects from the list of 
possibl e counterpar t s. 

In lDuffv et al.l (|2012bh we considered two optical 
catalogues, one with photometric redshifts (a typical 
redshift error for such galaxies is Az = 0.05 as ar- 
gued by iHildebrandt et all l2008h and a spectroscopic 
redshift sample. We estimated that in the latter case, 
the uncertainty as to which galaxy was the counter- 
part was when two galaxy rotation widths overlapped, 
hence we conservatively assumed Az = 0.002 (twice 
the width of an M* galaxy which is the likely system 
detected by ASKAP at high redshiftQ. 



4.3.1 DINGO DEEP 

In Fig.[l3]we consider the case for the DINGO DEEP 
(UDEEP) survey in the left (right) columns. For both 
telescopes we consider a resolution of 30" in the top 
panel and the high resolution imaging at 10" in the 
bottom panel. For each case we consider an optical 
catalogue with only photometric redshifts (Az — 0.05) 
and one with spectroscopic redshifts (Az = 0.002), 
results given by the red square and black diamond 
points respectively. We find that the Hi galaxies sur- 
veyed with the 30" ASKAP beam at z = 0.05 can be 
uniquely identified with a single optical counterpart, 
with photometric errors, more than 96% of the time 
(top left panel). However, the misidentification rate 
strongly rises to ~ 18% at the survey edge, z = 0.26. 
If one has access to a spectroscopic survey then even 
at the edge of the survey optical counterparts can be 
correctly identified 95% of the time. 

If one has access to increased resolution at 10" for 
DINGO DEEP (bottom left panel) then the misidenti- 
fication rate is ^ 3% for the entirety of the survey even 
when only photo-z catalogu es are available, in agree- 
ment with our conclusion in iDuffv et al. I (|2012bh . 



8 Although individual velocity widths of the galaxies are 
available, and indeed were used in the confusion study of 
Section 14.11 wc adopt the limiting spectro scopic width in 
accord ance with previous esti mates from IStavelev-Smithl 
feOOSft and lDuffv et al.l j2012bT) . 



4.3.2 DINGO UDEEP 

As we showed for the confusion rates of DINGO 
UDEEP (see Section I4.1|l it is much harder to sepa- 
rate galaxies at higher redshift than at low redshift 
and it is therefore unsurprising that the misidentifica- 
tion rate with an optical catalogue is also much higher 
than in the case of the lower redshift DINGO DEEP. 

We find that if one only has access to the 30" 
DINGO UDEEP source list and attempt to uniquely 
identify counterparts from a photometric catalogue 
(red squares top right panel of Fig. I13|l then at the be- 
ginning (end) of the survey the misidentification rate is 
7% (31.5%). If using a spectroscopic optical catalogue 
(black diamonds top right panel of Fig. I13[) then the 
rates are 1-12% across the survey. 

As before if one images the field at 10" (bottom 
right panel of Fig. 1 13 p then even with a photomet- 
ric catalogue the misidentification rate is never higher 
than 6% (reached at z = 0.43) and when combined 
with a spectroscopic optical survey this would drop to 
of order a percentage. 

In conclusion, the rates of confusion for 
Hi galaxies are negligible with ASKAP provided one 
has access to the highest possible resolution of 10" syn- 
thesised beams. Key science goals that rely on the iden- 
tification of the optical counterparts to Hi sources can 
make use of photometric catalogues, suffering from less 
than 9% misidentification rates (even out to z = 0.43 
for the case of DINGO UDEEP) if one has the high 
resolution imaging. 

If ASKAP synthesised beams of 30" are used then 
DINGO DEEP can still identify an optical counter- 
part at z = 0.05 (0.26) using a photo-z catalogue more 
than 96 (82.5)% of the time. For DINGO UDEEP the 
photo-z catalogue will suffer a misidentification rate of 
5 (30)% at the start (end) of the survey. If a spectro- 
scopic catalogue is available then even with 30" reso- 
lution the UDEEP sources will be uniquely identified 
at z = 0.43 88% of the time. 

In conclusion, we find that the quality of the op- 
tical catalogue depends strongly on the ASKAP reso- 
lution. For DINGO DEEP and UDEEP science goals 
that can accommodate misidentification rates of order 
5% then a photometric redshift catalogue is sufficient 
with 10" resolution imaging from ASKAP. However, 
for certain science goals such as Hi spectral stacking, 
spectroscopic optical catalogues are essential, irrespec- 
tive of ASKAP resolution. 



5 COMPARISON OF SURVEYS 

We combine the overall galaxy count distribution as a 
function of redshift for all three surveys (WALLABY, 
DINGO DEEP and UDEEP) in Fig. [H] which clearly 
demonstrates the 'wedding cake' design of successfully 
deeper, overlapping, surveys which produce significant 
galaxy samples of order 10 2 systems per redshift bin 
across the entire redshift range z = — 0.43 of ASKAP. 
The WALLABY survey will be able to find ~ 

6 x 10 5 galaxies at S/N — 5 which will be nearly two 
orders of magnitude more galaxies than the current 
Hi survey all-sky surveys have been able to find. We 
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Figure 13. We have calculated the misidcntification rate of galaxies when assigning an optical counterpart to Hi detections 
for the 30" and 10" resolutions of ASKAP (top and bottom panels) in the DINGO DEEP and UDEEP surveys (left and right 
columns respectively) . We considered two typical redshift errors that an optical catalogue would suffe r; the first in red squares 
is a photometric redshift uncertainty of the position of the optical counterpart position (as argued by Hildcbrandt et al. 2008 
this is typically Az = 0.05) and in black diamonds we consider a spectroscopic survey with typical uncertainties assumed 
to be t wice the velocity wi dth of an M* galaxy (Az = 0.002) which will be detectable throughout the redshift range of the 
survey llDuffv et alj|2012lj ). The rates are normalised by the number of galaxies within the redshift slice, with Poissonian 
errors for each point. Within the greater volume of the ASKAP beam and optical photo-z errors there are more potential 
optical counterparts. At the edge of the DINGO UDEEP survey more than 30% of galaxies will be misassigned. If there are 
follow-up observations at 10" the misidentification rate drops to ~ 3% « 10%) for DINGO DEEP (UDEEP). 



have modelled the number counts as a function of S/N 
in Table [4] with a smooth drop in galaxy detections 
for more demanding thresholds. DINGO may, through 
the DEEP and UDEEP tiered approach, detect ~ 10 5 
galaxies over the redshift range — 0.43, covering 5 
billion years of cosmic evolution. 

We also considered a Northern hemisphere 
Hi survey using the Westerbork array called WNSHS 
which was midway in flux sensitivity between WAL- 
LABY and DINGO DEEP. This facility can poten- 
tially detect ~ 2 (5) x 10 5 galaxies dependent on the 
depth to which it surveys, making it an incredibly pow- 
erful dataset in its own right. However, if one were to 
combine both WALLABY and WNSHS the potential 
science increases with a combined catalogue of over a 
million HI sources with integrated Signal-to-Noise of at 
least 5 with spectroscopic redshifts and a full 4-7T sky 
coverage. This will enable bulk flow studies and veloc- 
ity field probes using a contiguous, blind dataset which 
will dramatically improve the power of this promising 
probe of cosmology and structure formation. 



6 CONCLUSION 

ASKAP is a uniquely powerful telescope, capable of 
cataloging 6 x 10 5 galaxies in the local Universe and 
probing the distribution of Hi out to z = 0.43 with 
5 — 6 x 10 4 galaxies (dependent on evolutionary pos- 
sibilities), creating a more complete view of Hi than 
ever before. The systems we expect to find will probe 
4 orders of magnitude in total halo mass and 7 orders 
of magnitude in stellar mass. This will greatly aid our 
understanding of galaxy formation as well as the na- 
ture of dark matter haloes. Furthermore a subsample 
of the nearby detections are well resolved by ASKAP, 
with ~ 6000 (~ 700) galaxies resolved by > 5 (> 10) 
beams, providing a valuable resource for the dynamical 
modelling of galaxies in 3D. However, if the full 6 km 
baseline for ASKAP is available and the galaxies are 
surveyed with 10" resolution then significantly more 
systems are well-resolved; ~ 1.5 x 10 5 (2x 10 4 ) galaxies 
resolved by > 5 (> 10) beams. This resolved catalogue 
can then probe the small scale clustering of the dark 
matter and the influence that the baryons have at such 
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Figure 14. The expected galaxy number density in rcdshift 
bins of width A = 0.02 for the DINGO DEEP, UDEEP and 
WALLABY surveys. For DINGO UDEEP we take the con- 
servative case that there is no evolution in the Hi mass func- 
tion across the rcdshift probes, and for WALLABY we re- 
duce the number of galaxies found by an order of magnitude 
for ease of comparison with the deeper surveys. Note the 
'wedding cake' design of these overlapping surveys which 
ensure significant numbers of galaxies are probed at each 
rcdshift. 



scales (e.g. iDuffv et al ] |20ld ). As well as a crucial aid 
to studying the velocity structure of nearby objects 
the post age- stamp zoom-in's around galaxy detections 
are crucial to lowering the confusion rate of the distant 
galaxy population to sub-percent levels for all spectro- 
scopic surveys with ASKAP. The postage-stamps will 
also be crucial in making possible unambiguous assign- 
ments of optical counterparts to the Hi detections, a 
key scie nce case for multi-w avelength surveys such as 
GAM A (|Driver et alj|201lft . 

As a community resource we make available cata- 
logues for both surveys with the sky coordinates (RA 
and DEC), redshift and observed redshift (i.e. includ- 
ing peculiar motion) as well as the stellar, halo, cold 
gas and Hi mass and finally the velocity width of the 
system. This catalogue will be invaluable in guiding 
survey preparations for Tully-Fisher studies, velocity 
field probes as well as correlation function investi- 
gations to best utilise the formidable resource that 
ASKAP will represent for a wide range of astronomical 
fields. 
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Figure Al. The lightconc pic-plot for the shallow all-sky survey WALLABY and the two deeper surveys from DINGO. We 
have taken the entire Declination range of the survey and projected it onto a redshift-RA 2D plot. Brightness is based on 
the number density of the sources in each map pixel. The cosmic web is clearly visible in this image, a key science driver 
of WALLABY is the measurement of the Hi mass function in different environments. In this volume are over 0.6 million 
galaxy detections, which can be compared with HIPASS and ALFALFA which had approximately two orders of magnitude 
fewer detections. The DINGO suvey will probe evolution in the high mass end of the Hi mass function over 4 billion years 
of cosmic time. Additionally, DINGO will overlap with existing GAMA fields to enable a wealth of multi-wavelength data 
to be used when analysing the Hi detections; as well as enabling Hi spectral stacking at given optical spectroscopic redshifts 
to extend the Hi detections. We reiterate that the DINGO fields arc not actually contiguous as pictured here, but in fact 
g)e201^:<WA^:rM^J1ftA§k006'pS HB@l mock lightconc we made the simplifying assumption that the fields were contiguous, this 
had no impact on the final number of galaxies predicted. High res versions of this image and fly-through movies available at 
http: //ict . icrar.org/store/Movies/Duffyl2c/ 



